AITopics | landscape model

Collaborating Authors

landscape model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Large Scale Structure of Neural Network Loss Landscapes

Stanislav Fort, Stanislaw Jastrzebski

Neural Information Processing SystemsFeb-12-2026, 01:58:40 GMT

Neural Information Processing Systems http://nips.cc/

landscape, loss landscape, optima, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States (0.04)
North America > Canada (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

48042b1dae4950fef2bd2aafa0b971a1-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 01:58:24 GMT

Wethen define asurrogatelossLtoy(~P RD)foranetworkconfigurationP inthisweight11 space, which we choose to depend monotonically on theL2 distance to the nearestn-wedge. Together, these21 define locally ann-dimensional hyperplane of finite thickness in the remainingD nthin direction, i.e. acuboid.22 To go beyond classification, we also looked at CNN-based31 autoencoders. In all cases the results supported our landscape model and we will include them in the final version.32 R5: Radial tunnels = what low-dimensional cuts would show.

artificial intelligence, landscape model, machine learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.37)

Add feedback

Large Scale Structure of Neural Network Loss Landscapes

Stanislav Fort, Stanislaw Jastrzebski

Neural Information Processing SystemsOct-2-2025, 16:06:34 GMT

We propose and experimentally verify a unified phe-nomenological model of the loss landscape that incorporates many of them.

artificial intelligence, landscape, machine learning, (18 more...)

Neural Information Processing Systems

Country: Europe > Switzerland (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A common point you brought up

Neural Information Processing SystemsOct-2-2025, 16:06:18 GMT

Thank you very much for your detailed reviews and comments. The simplest version of our toy landscape is constructed as follows. As such, our toy model serves us well, albeit it doesn't In real nets, we find a large number of weight-space directions in which we can move very far, while the loss doesn't We find the full low-loss manifold to be a union of those in different directions and orientations. We will include this extended discussion in the paper. In all cases the results supported our landscape model and we will include them in the final version.

artificial intelligence, experiment, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

Large Scale Structure of Neural Network Loss Landscapes

Fort, Stanislav, Jastrzebski, Stanislaw

arXiv.org Machine LearningJun-11-2019

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High dimensionality plays a key role in our model. Our core idea is to model the loss landscape as a set of high dimensional \emph{wedges} that together form a large-scale, inter-connected structure and towards which optimization is drawn. We first show that hyperparameter choices such as learning rate, network width and $L_2$ regularization, affect the path optimizer takes through the landscape in a similar ways, influencing the large scale curvature of the regions the optimizer explores. Finally, we predict and demonstrate new counter-intuitive properties of the loss-landscape. We show an existence of low loss subspaces connecting a set (not only a pair) of solutions, and verify it experimentally. Finally, we analyze recently popular ensembling techniques for deep networks in the light of our model.

artificial intelligence, landscape, machine learning, (19 more...)

arXiv.org Machine Learning

1906.04724

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback